JBIOTEC 

" SYSTEMS 
BRANCH. 



RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnalogy Systems Branch of th^e^ 

Center (STIC) detected errors when procJfessmg the following computer^j^^^ I \/ E D 

form: ■ y''^\' '.:\--y'^:' ''' :...-''.;>r:;:v--; 

na/u:/y/ ^P^4 MAR 0 5 2001 
Application Serial Number: UT/^TbAJoTn 

' ff n^ TECH CENTER 1600/2900 

Source: g-'^'o'v/'i-^ -y.:--/ : •' vV---''-^ 
Date Processed by STIC: __ 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONmG APPLICANT AND FAXING A COPY OF THIS PmNTO 

;^v:::r ^NOTiGi-TO^^C^ 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 



FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN il e-mail help: patin21heln(a)uspto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3heln(g)iisntb.gQv or phone 703-306-41 19 (R* Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3 0 PROGRAM , ACCESSIBLE THROUGH THE U S PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



The Checker Version 3 0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules . Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25. - ^ 

Checker Version 3\0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 



Checker Version 3,0 can be down loaded from the IJSPTO website atthe following address: 

http://www.uspto.gov/web/offic^s/pac/checker 



cqucncc Listing Error fPWfnary 



ERROR DETECTED SUGGESTED CORRECTION 



SERIAL NUMBER: 



ATTM: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA" HEADERS. WHICH WERE INSERTED BY PTO SOFTWARE 



J 



Wrapped Nucleics 



Wrapped Aminos 



The numberAext at the end of each tine "wrapped** down to the next line. 
This may occur if your file was retrieved in a word processor after creating it. 
Please adjust your right margin to .3, as this will prevent "wrapping". 



RECEIVED 

MAR 0 5 2001 



The amino acid numberAext at the end of each line Vrapped " down to the next line. 

This may occur if your file was retrieved in a word processor after creating it. TECH CENTER 1 9s^(\l00f\(\ 

Please adjust your right margin to .3. as this will prevent "wrapping". vUU/ci7UU 

Incor/ect Line Lenqth The rules require that a line not exceed 72 characters in length. This includes spaces. 



Misaligned Amino Acid 
Numbering 

Non-ASCII 



Variable Length 



Patentin ver. 2.0 "bug" 



Skipped Sequences 
(OLD RULES) 



Skipped Sequences 
(NEW RULES} 



Use of n's'or Xaa's 
(NEW F^L^)* 



The numbering under each 5lh amino acid is misaligned. This may be caused by the" use of tabs 
between the numbering. It is recommended to delete any tabs and use spacing^Xetweert the numbers. 

This file was not saved in ASCI! (DOS) text, as required by the Sequence Rules. 

Piease ensure your subsequent submission is saved in ASCII text so that it can be processed. 

Sequence{s) contain n's or Xaa's which represented more than one residue. 

As per the rules, each n or Xaa can only represent a single residue. 

Please present the maximum number of each residue having variable length and 

\v{6\cbXq in the (ix) feature section that some may be missing. 

A "bug" in Patentfn version 2.0 has caused the <220>-<223> section to be missing Uom ^nxmo acid 

sequence(s) . Normally, Patentin would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section 
to the subsequent amino acid sequence. This applies primarily to the nnandalory <220>-<223> 
sections for Artificial or Unknown sequences. _^ ^ 

Sequence(s) missing. If intentional, please use the following format for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: 

(i) SEQUENCE CHARACTERISTICS:(Oo not insert any headings under "SEQUENCE CHARACTERISTICS") 
(xi) SEQUENCE DESCRIPTIONrSHQ \0 NO:X: 
This sequence is intentionally skipped 

Please also adjust the "(iii) NUMBER OF SEQUENCES:" response to include the skipped sequence(s). 

Sequence(s) missing. If intentional, please use the following format for each skipped sequence. 

<210> sequence id number 
<:400> sequence id x\xxxx\\>(ii 
000 

Use of n's and/or JCaa's have been detected in the Sequence Listing. 
Use of <220> to <223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa. and which residue n or Xaa represents. 



Use of <21 3>Organism SequenceCs) ^ 
(NEW RULES) 



are missing this mandatory field or its response. 



Use of <220>F«ature 
(NEW RULES) 



Patentin ver. 2.0 "bug" 



- /o - . ' : - ^ - 

SequenceCs) I are missing the <220>Feature and associated headings. 

Use of <220> to <223> is MANDATORY if <21 3>ORGAN(SiVf is •'Artificiar or "Unknown" 
Please explain source of genetic material in <220> to <223> section. 

(See "Federal Register." 6/01/98. Vol. 63', No. 104, pp. 29631-32) (Sec. 1.823 of new Rules) 

Please do not use "Copy to Disk" function of Patentin version 2.0. This causes a corrupted 
'fire^Te'^tflting in missing mandatdiy numeric identifiers and responses (as \r\6\oaie6 on raw sequence listing). 
Instead, please use "File Manager" or any other means to copy file to floppy disk. 
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TECH CENTER 1600/2900 



RAW SEQU!:;NCE listing DATE; 02/27/2001 

PATENT ^IMPLICATION: US/09/446 , 089A TIME: 14 :31:44 

Input Set : A:\001560--377 , ST25.t.xt 
Output Set: N:\CRF-3\02272001\I446089A.raw 



3 <110> APPLICANT: SAKAKIBARA, Keiko yO^'' U ttfi N©^ 

4 POKui, Yuko rnrrected DisKewt- 



5 TANAKA , Yoshikazu ^-^^^^ 

6 ■ KUSUMI, Takaaki 

7 MIZUTANI/ Masako 

8 NAKAYAMA, Toru 
10 <120> TITLE OF INVENTION: GENE ENCODING PROTEIN HAVING Z^QRONE SYNTHESIZING ACTIVITY 
12 <130> FILE REFERENCE: 001560-377 : 

14 <I40> CURRENT APPLICATION NUMBER; US 09/446, 0B9A 

15 <141> CURRENT FILING DATE: 1999-12-17 

17 <150> PRIOR APPLICATION NUMBER: PCT/ JP9 9/02 04 5 

18 <151> PRIOR PILING DATE: 1999"04~16 

20 <150> PRIOR APPLICATION NUMBER: JP 10/107296 

21 <151> PRIOR FILING DATE: 1998-04-17 
23 <160> NUMBER OF SEQ ID NOS : 15 
25 <17 0> SOFTWARE: Patentin version 3.0 

27 <210> SEQ ID NO: 1 

28 <211> LENGTH: 1951 

2 9 <212> TYPE: DNA 

30 <213> ORGANISM: Antirrhinum majus 
32 <220> FEATURE: 
33- <221> NAME/KEY; CDS 
34 <222> LOCATION: (96)., (1781) 
36 <400> SEQUENCE; 1 

3 7 aaattacatt gcttcctttg tcccaccttc caccaccaat atatacaact tcctcagcta 60 

3 9 gttgtttatt atcaatcaaa taaaattatt tccca atg ttc aaa aat cct aat 113 

4 0 Met Phe Lys As n Pro As n 
41 1 5 

4 3 ate cgc tat cac aaa eta tet tec aaa tec aat gac aac gat eaa gaa 161 

44 lie Arg Tyr His Lys Leu Ser Ser Lys Ser Asn Asp Asn Asp Gin Glu 

45 10 15 20 

4 7 tec tec cat cgt tgt aag eac att eta tta ttt ata ata acc tta ttc 209 
4B Ser Ser His Arg Cys Lys His lie Leu Leu Phe lie lie Thr Leu Phe 
49 25 30 35 

51 eta ctt ata gtt ggc ctg tac ate gee aac tct etc gee tat gcc egg 257 

52 Leu Leu lie Val Giy Leu Tyr He Ala Asn Ser Leu Ala Tyr Ala Arg 

53 4 0 4 5 -50 

5 5 ttt gcc teg acc tea acc ggc cct ate gcc gcc cct gat gtc acc aaa 3 05 
5 6 Phe Ala Ser. Thr Ser Thr Gly Pro He Ala Ala Pro Asp Val Thr Lys 
57 55 60 65 70 

5 9 tgt ggt cag cca gac ttg cca cct ggc aca gcc cca ata aac tgt tgt 3 53 

6 0 Cys Gly Gin Pro Asp Leu Pro Pro Gly Thr Ala Pro He Asn Cys Cys 
61 75 80 85 
6 3 ccc cca ate cec get aaa ate ate gat ttc gag eta cca cct ccc tec 4 01 

64 Pro Pro He Pro Ala Lys He He Asp Phe Glu Leu Pro Pro Pro Ser 

65 90 95 100 

67 act acc atg agg gtt cgc cgt gcg get cat tta gtt gat gat gca tac 44 9 
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Pro 


Pro 


He 


Pro 


Ala 


Lys 


He 


He 


Asp 


Phe 


218 










85 










90 










95 




221 


Glu 


Leu 


Pro 


Pro 


Pro 


Ser 


Thr 


Thr 


Met 


Arg 


Val 


Arg 


Arg 


Ala 


Ala 


His 


222 








100 










105 










110 






225 


Leu 


Val 


Asp 


Asp 


Ala 


Tyr 


He 


Ala 


Lys 


Phe 


Lys 


Lys 


Ala 


Val 


Glu 


Leu 


226 






115 










120 










125 








229 


Met 


Arg 


Ala 


Leu 


Pro 


Glu 


Asp 


Asp 


Pro 


Arg 


Ser 


Phe 


Lys 


Gin 


Gin 


Ala 


230 




130 










13 5 










140 










233 


Asn 


Val 


His 


Cys 


Ala 


Tyr 


Cys 


Ala 


Gly 


Ala 


Tyr 


Asn 


Gin 


Ala 


Gly 


Phe 


234 


14 5 










150 










155 










160 


237 


Thr 


Asn 


Leu 


Lys 


Leu 


Gin 


He 


His 


Arg 


Ser 


Trp 


Leu 


Phe 


Phe 


Pro 


Phe 


238 










165 










170 










175 




241 


His 


Arg 


Tyr 


Tyr 


lie 


Tyr 


Phe 


Phe 


Glu 


Arg 


He 


Leu 


Gly 


i'YS 


Leu 


He 


242 








180 










185 










190 






245 


Asn 


Asp 


Thr 


Thr 


Phe 


Ala 


Leu 


Gin 


Phe 


Trp 


Asn 


Tyr 


Asp 


Ser 


Pro 


Gly 


246 






195 










200 










205 








249 


Gly 


Met 


Thr 


He 


Pro 


Ser 


Met 


Phe 


He 


Asp 


Thr 


Asn 


Ser 


Ser 


Leu 


Tyr 


250 




210 










215 










220 










2 53 


Asp 


Ser 


Leu 


Arg 


Asp 


Ser 


Asn 


His 


Gin 


Pro 


Pro 


Thr 


He 


Val 


Asp 


Leu 


254 


225 










230 










235 










240 


257 


Asn 


Tyr 


Ala 


Phe 


Ser 


Asp 


Ser 


Asp 


Asn 


Thr 


Thr 


Thr 


Pro 


Glu 


Glu 


Gin 


258 










245 










250 










255 




261 


Met 


He 


He 


Asn 


Leu 


Lys 


He 


Val 


Tyr 


Arg 


Gin 


Met 


Val 


Ser 


Ser 


Ala 


262 








260 










265 










270 






265 


Lys 


Thr 


Pro 


Gin 


Leu 


Phe 


Phe 


Gly 


Ar g 


Pro 


Tyr 


Arg 


Arg 


Gly 


Asp 


Gin 


266 






275 










280 










285 








269 


Glu 


Phe 


Pro 


Gly 


Val 


Gly 


Ser 


He 


Glu 


Leu 


Val 


Pro 


His 


Gly 


Met 


He 


270 




290 










295 










300 










273 


ti 1 s 


Leu 


Trp 


Thr 


Gly 


Ser 


Glu 


Asn 


Thr 


Pro 


Tyr 


Gly 


Glu 


Asn 


Met 


Gly 


274 


305 










310 










315 










320 


277 


Ala 


Phe 


Tyr 


Ser 


Thr 


Ala 


Arg 




Pro 


He 


Phe 


Phe 


Ala 


His 


His 


Ser 


278 










325 










330 










335 




2 81 


Asn 


Vai 


Asp 


Arg 


Met 


Trp 


Ser 


He 


Trp 


Lys 


Thr 


Leu 


Gly 


Gly 


Pro 


Arg 


282 








340 










345 










350 






285 


Arg 


Thr 


Asp 


Leu 


Thr 


Asp 


Pro 


Asp 


Phe 


Leu 


Asp 


Ala 


Ser 


Phe 


Val 


Phe 


285 






355 










360 










365 








289 


Tyr 


Asp 


Glu 


Asn 


Ala 


Glu 


Met 


Val 


Arg 


Val 


Lys 


Val 


Arg 


Asp 


Cys 


Leu 


2 90 




370 










375 










380 










293 


Asp 


Glu 


Lys 


Lys 


Leu 


Gly 


Tyr 


Val 


Tyr 


Gin 


Asp 


Val 


Glu 


He 


Pro 


Trp 


294 


385 










390 










395 










400 


297 


Ijeu 


Asn 


Thr 


Arg 


Pro 


Thr 


Pro 


Lys 


Val 


Ser 


Pro 


Ser 


Leu 


Leu 


Lys 


Lys 


298 










405 










410 










415 




301 


Phe 


Plis 


Arg 


Thr 


Asn 


Thr 


Ala 


Asn 


Pro 


Arg 


Gin 


Val 


Phe 


Pro 


Ala 


He 
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302 


4 20 










425 










430 






305 


Leu Asp Arq- Val 


Leu 


Lys 


Val 


He 


Val 


Thr 


Arg 


Pro 


Lys 


Ly s 


Thr 


Ai^g 


306 


43 5 








44 0 










445 








309 


Ser Arg Lys Glu 


Lys 


Asp 


Glu 


Leu 


Glu 


Glu 


He 


Leu 


Val 


He 


Glu 


Gly 


310 


4 50 






455 










460 










3 1 3 


He Glu Leu Glu 


Arg 


Asp 


His 


Gly 


His 


Val 


Lys 


Phe 


Asp 


Val 


Tyr 


He 


314 


465 




470 










475 










480 


317 


Asn Ala Asp Glu 


Asp 


Asp 


Leu 


Ala 


Val 


He 


Ser 


Pro 


Glu 


Asn 


Ala 


Glu 


318 




48 5 










490 










4 95 




321 


Phe Ala Gly Ser 


Phe 


Val 


Ser 


Leu 


Trp 


His 


Lys 


Pro 


He 


Lys 


Gly 


Lys 


322 


50 0 










505 










510 






325 


A :r g T hr Ij y s T h r 


Gin 


Leu 


Leu 


Thr 


Leu 


Ser 


He 


Cys 


Asp 


He 


Leu 


Glu 


32 6 


515 








520 










52 5 








329 


Asp Leu Asp Ala 


Asp 


Glu 


Asp 


Asp 


Tyr 


Val 


Leu 


Val 


Thr 


Leu 


Val 


Pro 


3 30 


530 






535 










540 










333 


Arg Asn Ala Gly 


Asp 


Ala 


He 


Lys 


He 


His 


Asn 


Val 


Lys 


He 


Glu 


Leu 


334 


54 5 




550 










555 










550 


33 7 


Asp Gly 


























341 


<210> SEQ ID NO: 


3 
























342 


<211> LENGTH: 13 


























34 3 


<212> TYPE: PRT 


























344 


<213> ORGANISM: 


Antirrhinum 


maj us 
















345 


<4 0 0> SEQUENCE: 


3 
























348 


Lys Lys Leu Gly 


Tyr 


Val 


Tyr 


Gin 


Asp 


Val 


Glu 


He 


Pro 








349 


1 


5 










10 














352 


<210> SEO ID NO: 


4 
























353 


<211> LENGTH: 12 
























354 


<212> TYPE: PRT 


























355 


<213> ORGANISM: 


Antirrhinum 


ma j us 
















357 


<4 0 0> SEQUENCE; 


4 
























359 


Lys He Val Tyr 


Arg 


Gin 


Met 


Val 


Ser 


Ser 


Ala 


Lys 










360 


1 


5 










10 














362 


<210> SEQ ID NO: 


5 
























363 


<211> LENGTH: 18 
























364 


<212> TYPE: PRT 


























36 5 


<213> ORGANISM: 


Antirrhinum 


maj us 
















367 


<4 0 0> SEQUENCE: 


5 
























369 


Lys Thr Pro Gin 


Leu 


Phe 


Phe 


Gly 


Arg 


Pro 


Tyr 


Arg 


Arg 


Gly 


^sp 


Gin 


370 


1 


5 










10 










15 





372 Glu Phe 

375 <210> SEQ ID NO: 6 

376 <2H> LENGTH: 29 

377 <212> TYPE; PRT 

378 <213> ORGANISM: Antirrhinum majus 
380 <220> EEATURE: 

3 81 <221> NAME/KEY: UNSURE 

382 <222> LOCATION: (9).. (9) 

383 <223> OTHER INFORMATION: Amino acid 9 is Xaa wherein Xaa = unknown or other. 
3 85 <2 20> FEATURE: 
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<210> 10 

<211> 6 

<212> PRT/"""^ 
<213>^^^^^tificial Sequence 

<400> 10 

His Ala Val Cys Asn Glu 
1 5 



Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220>^to 
<223> fields of each sequence which presents at least one n or Xaa. 




VERIFICATION SUMMARY DATE: 02/27/2001 

PATENT APPLICATION: US/09/446 , 089A TIME: 14:31:45 



Input Set : A:\001560~377.ST25.txt 
Output Set: N:\CRF3\0227 2 0 01\l44 6 08 9A.raw 



L 


392 


M 


341 


W 


(46) "n" or "Xaa" 


used , for 


SEQ 


ID#:6 


h 


3 95 


M 


3 4 1 


W 


(46) "n" or "Xaa" 


used/ for 


SEQ 


ID# : 6 


L 


442 


M 


341 


W 


(46) "n" or "Xaa" 


used r . for 


SEQ 


ID# : 8 


h 


■4 5 8 


M 


3 4 1 


W 


(46) "n" or "Xaa" 


used/ for 


SEQ 


ID# : 9 


L 


466 


M 


258 


W 


Mandatory F'eature^ 


missing , 


<220> 


FEATURE : 


I, 


466 


M 


258 


W 


Mandatory Feature 


missing/ <223> 


OTHER INFORMATION 


L 


484 


M 


341 


W 


(46) "n" or "Xaa" 


used/ for 


SEQ 


ID# : 11 


L 


499 


M- 


341 


W 


(46) "n" or "Xaa" 


used / for 


SEQ 


ID# : 12 


L 


514 


M: 


341 


W 


(46) "n" or "Xaa" 


used/ for 


SEQ 


ID# ; 13 
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